Prediction models for clustered data: comparison of a random intercept and standard regression model
نویسندگان
چکیده
BACKGROUND When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. METHODS Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. RESULTS The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. CONCLUSION The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters.
منابع مشابه
Comparison of Artificial Neural Networks and Cox Regression Models in Prediction of Kidney Transplant Survival
Cox regression model serves as a statistical method for analyzing the survival data, which requires some options such as hazard proportionality. In recent decades, artificial neural network model has been increasingly applied to predict survival data. This research was conducted to compare Cox regression and artificial neural network models in prediction of kidney transplant survival. The prese...
متن کاملComparison of Artificial Neural Networks and Cox Regression Models in Prediction of Kidney Transplant Survival
Cox regression model serves as a statistical method for analyzing the survival data, which requires some options such as hazard proportionality. In recent decades, artificial neural network model has been increasingly applied to predict survival data. This research was conducted to compare Cox regression and artificial neural network models in prediction of kidney transplant survival. The prese...
متن کاملImpact of Health Research Systems on Under-5 Mortality Rate: A Trend Analysis
Background Between 1990 and 2015, under-5 mortality rate (U5MR) declined by 53%, from an estimated rate of 91 deaths per 1000 live births to 43, globally. The aim of this study was to determine the share of health research systems in this decrease alongside other influential factors. Methods We used random effect regression models including the ‘random intercept’ and ‘random intercept and ran...
متن کاملComparison of Artificial Neural Network and Regression Models for Prediction of Body Weight in Raini Cashmere Goat
The artificial neural networks (ANN) are the learning algorithms and mathematical models, which mimic the information processing ability of human brain and can be used to non linear and complex data. The aim of this study was to compare artificial neural network and regression models for prediction of body weight in Raini Cashmere goat. The data of 1389 goats for body weight, height at withers ...
متن کاملComparison of Methods for Clustered Data Analysis in a Non-Ideal Situation: Results from an Evaluation of Predictors of Yellow Fever Vaccine Refusal in the Global TravEpiNet (GTEN) Consortium
Not accounting for clustering in data from multiple centers might yield biased estimates and their standard errors, potentially leading to incorrect inferences. We fit 15 different models with different correlation structures and with/without adjustment for small clusters, including unadjusted logistic regression, Population-averaged models (Generalized Estimating Equations), Cluster-specific m...
متن کامل